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Abstract 

Given positive integers n and m, and a probability measure P on 
{0, 1, . . . , m} the random intersection graph G{n, m, P) on vertex set 
V = {l,2,...,n} and with attribute set W = {wi,W2t ■ ■ ■ ,Wm} is 
defined as foUows. Let Si, 82, ■ ■ ■ , Sn be independent random subsets 
of W such that for any v ^ V and any S C W we have ¥{Sv = 
S) = F(|S'|)/(|™|). The edge set of G{n,m,P) consists of those pairs 
{u, v}CV for which SuDSy^ 0. 

We study the asymptotic order of the chque number uj{G{n, m, P)) 
in random intersection graphs with bounded expected degrees. For 
instance, in the case m = Q{n) we show that if the vertex degree 
distribution is power-law with exponent a € (1; 2), then the maximum 
clique is of a polynomial size, while if the variance of the degrees is 
bounded, then the maximum clique has (1 + o(l)) vertices whp. 

In each case we give a polynomial algorithm which finds a clique of 
size w(G(n, TO, P))(l — o(l)) whp. One of ingredients of our proofs is 
a result of Alon et al [I] in Ramsey theory. 
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1 Introduction 



Bianconi and Marsili observed in 2006 that empirical 'scale-free' networks can have 
very large cliques; they gave an argument suggesting that the rate of divergence 
is polynomial if the degree variance is unbounded In a more precise analysis, 
Janson, Luczak and Norros [S] gave exact asymptotics for the clique number in a 
power-law random graph model where edge probabilities are proportional to the 
product of weights of their endpoints. However, with conditionally independent 
edges, the random graphs of ^ do not have the clustering property which may be 
important in explaining the presence of large cliques in real- world networks. 

The random intersection graph model was introduced by Karohski, Schein- 
erman and Singer-Cohen in 1999 [IX and further generalised by Godehardt and 
Jaworski [8] and others. With appropriate parameters, it yields graphs that are 
sparse J5] , have a positive clustering coefficient O H] and assortativity f4l . In the 
introductory paper [llj . thresholds for small (constant-sized) cliques were deter- 
mined; the asymptotic distribution of the number of small cliques was studied in 
[14| . We provide asymptotics for the size of the largest clique in random intersection 
graphs with linear number of edges. 

For a sequence (G(n), n = 1, 2, . . . ) of random intersection graphs with G{n) = 
G{n,'m, P) and m = m{n), P — P{n) we let X{n) denote a random variable 
distributed according to P{n) and define Y{n) := yj^X{n). If not stated otherwise 
the limits below will be taken as n — >■ cxd. In this paper we use the standard notation 
o(), 0(), r2(), 9(), op(), Op(), see, for example, [TU]. We will consider sequences 
(G(n)) for which 

EY{n)^0{l). (1) 

This condition ensures that the resulting graphs are sparse, that is, the expected 
number of edges in G(n) is 0{n). 

A function L : R_|_ — ^ R+ is called slowly varying if Imix^oc L(tx) / L(x) = 1 
for any t > 0. For positive sequences (a„), (&„) we write a„ ^ 6„ if a.n/bn — > 1 
as n — >■ oo. We say that (G(n)) is power-law with index a if there are positive 
numbers a, eo and a slowly varying function L such that for each sequence Xn with 
^i/2-eo <Xn< n^/'^+^o have 

¥{Y{n)>x,,)^L{x^)x-''. (2) 

Our first theorem shows that when a G (1; 2) the order of the maximum clique 
in G(n, m, P) is as in the considerably simpler model of W - 

Theorem 1.1 Let {G{n), n = 1,2, . . .) , G{n) = G{n, m{n),P(n)) he a sequence of 
random power-law intersection graphs with index a G (1; 2). Suppose that Ey(n) — 
0(1) and there is (3 > max(2 — a, a — 1) such that m — m{n) — Q,{n^). Let L he a 
slowly varying function given in Then the clique numher of G{n) is 

w{G{n)) = K{n){l op{l)) 

where 

K{n) = (1 ~ a/2)""/^ L ((nlnn)!/^) n^~"''^{\nn)-'''^ . 
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The set of edges of a random intersection graph G ~ G{n, m, P) with attribute 
set is a union of monochromatic cliques T(w) — {v ^ [n] : w £ S^} ■ We denote 
the size of a largest monochromatic clique by ui'{G). Clearly, the clique number of 
G is at least ui'{G). 

Our second theorem shows that the largest clique in a sparse random intersec- 
tion graph with variance of Y bounded is a monochromatic clique (plus possibly a 
few extra vertices). Denote xW y — max{x,y). 

Theorem 1.2 Let {G{n),n = 1,2,...), G{n) — G{n,m{n), P{n)) be a sequence 
of random intersection graphs. Suppose that ¥,Y{n) = 0{1), m{n) oo and 
Var{Y{n)) = 0(1). Then 

E {Lj{Gin)) ~ Lj'{G{n))f = 0(1). 
If, in addition, there is a positive sequence (e„) such that e„ — > and 

n¥{Y{n) > e™^^/^) ^ q (3) 

then 

P iuj{G{n)) < C V {uj'{G{n)) + 3)) ^ 1 
for an absolute constant C. 

Let us recall the maximum load problem. Consider the balls into bins model 
where each of N balls is thrown into one of m bins uniformly and independently at 
random. Let AI{N,m) be the maximum number of balls contained in any of the 
bins. The asymptotics of M{N,m) are well known, see, e.g.. Section 6 of Kolchin 
et al [12]. 

Denote by dxv (X, Y) the total variation distance between random variables X 
and Y (that is, dTviX,Y) = sup |P(X e A) - P{Y e A)\ over ah Borel sets A). 
Our next result supplements Theorem ll.2l bv providing asymptotics for the size of 
a largest monochromatic clique. 

Theorem 1.3 Let {G{n),n — 1,2...), G{n) — G{n,m{n), P{n)) be a sequence of 
random intersection graphs. Let Y = Y(n) be defined as in Suppos^ that 

EF = 9(1), Var{Y) = 0(1) and m{n) oo. Then 

dTv{i^'{G{n)), M{lEY{mn)^^^ \,m)) 0. 

For example, if m{n) = 8(71), Theorem 11.21 and Theorem 11.31 together with the 
asymptotics for M (Theorem II. 6.1 of [12 ), imply that 

In Ti 

u^{G{n)) = -—{l + op{l)). 
in mn 



small technical remark: Theorem 11.31 does not make the assumption ((SJ, but asks 
that Ey is bounded away from zero. 
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In contrast, for a constant c > 0, the Erdos-Renyi random graph has uj{G(n, c/n)) < 
3 whp and in the model of with corresponding parameters the maximum chque 
has at most 4 vertices. 

The condition ([3]) is not very restrictive. It can be replaced by a stronger 
requirement that both a) Y converges in distribution to a random variable Y* with 
Var{Y*) < oo, and b) Var{Y) converges to Var{Y*), see, e.g., [5]. 

Each of our main results (Theorem 11.11 and Theorem II. 2p have corresponding 
simple polynomial algorithms that construct a clique of the optimal order whp. For 
power-law random intersection graphs with a G (1;2), we may use a greedy algo- 
rithm considered in [5]. Sort the vertices of a graph in descending order according 
to their degree. Traverse vertices in that order and 'grow' a clique, by adding a 
vertex if it is connected to each vertex in the current clique. 

For random intersection graphs corresponding to Theorem 11.21 we propose an- 
other simple algorithm based on the fact that the largest clique whp contains two 
vertices which share only one attribute. For each pair of connected vertices, take 
any maximal clique formed by that pair and their common neighbours. Output the 
biggest maximal clique found in this way. More details and analysis of each of the 
algorithms are given in Section |4] below. 

An empirical graph may be assumed to be distributed as a random intersection 
graph, but information about the subset size distribution may not be available. 
Suppose (G(n), n = 1,2,...) is a sequence of random intersection graphs. Instead 
of the condition ([2]) for the tails of the normalised subset sizes, Y{n), we may 
consider a similar condition for the degree Di(n) = (iG'(„)(l) of vertex 1 in G{n): 
there are constants a' > 1, e' > and a slowly varying function L'{x) such that for 
any sequence t„ with n^^^^^ < t„ < n^/^+^ 

P(Z?i(n)>i„)^i'(t„K;"'. (4) 

The following lemma shows that, subject to an additional assumption, there is 
equivalence between these two conditions. 

Lemma 1.4 Let {G{n),n — 1,2,...), G{n) — G{n,m{n), P{n)) be a sequence of 
random intersection graphs, let Di ~ Di{n) be the degree of vertex 1 in G{n) and 
let Y — Y{n) be defined as in (Qp. Assume there is e > such that 

Eri^>„i/2-. -> 0. (5) 

Suppose either (Ey)^ or EDi converges to a positive number. 

Then both limits exist and are equal. Furthermore, the condition ^ holds if and 
only if 0j holds. In that case, a' — a and L'{t) ~ d"^'^L{t), where d = limEZ3i = 
lim(Ey)2. 

Thus, if we make the assumption ([5]) then we can use only the information about 
the degree distribution in Theorem ll.il 

Similarly, in Theorem 1 1.2 1 the condition that EF^ is bounded may be replaced 
by a condition that the degree variance is bounded. 
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Lemma 1.5 Let {G{n),n = 1,2,...), G{n) — G {n, 171(71), P{n)) be a sequeTice of 
raTidom iTitersection graphs, let Di = Di{n) be the degree of vertex 1 iti G{n) and 
let Y = Y{77) be defined as in Assume there is a sequence e„ — ;> such that 



■Y>e„n 



1/2 -J> 



(6) 



and either EY ^ orEDi^e(l). Then 



EDi = {EYf + o{l) 
VariDi) = Er(V"ar(r) + 1) + o(l). 



(7) 
(8) 



The plan of the rest of this paper is as foUows. In Section [5] we study sparse 
random power-law intersection graphs with index a G (1; 2), introduce the result on 
"rainbow" cliques in extremal combinatorics (Lemma 12. 8p and prove Theorem 11.11 
Next, in Section |3] we relate our model to the balls and bins model and prove 
Theorem 11.21 In Section 2] we present and analyse algorithms for finding large 
cliques in G{n, m, P). Then, in Section[5]we prove Lemmas 11.41 and 11.51 Finally, in 
the last section we give some concluding remarks. 

2 Random power-law intersection graphs 
2.1 Preliminaries 

Let {G(n),ri = 1,2,...} be a sequence of power-law random intersection graphs 
with index a £ (1; 2), where G(ri) = G{n, ni, P), m = m[n) and P — P{n). Assume 
that ^ holds. Let L : i?+ — > R+ be the slowly varying function given in ([2]). 
Let ei be a small positive constant and define 



01 ^0i{n) =7n^/^n-^'; 02 = 02{n) = {{I - a/2)7n\nn + meif'^ , 
wherfiei = ei(n) = niax(0,lni((nlnn)i/2)). For w G V{G{n)) letX^(n) = \S.^{n)\ 



be the size of random subset corresponding to v. For each n define random sets 



Also write Gi = Gi{n) = G{n)[Vi] for i = 1, 2, 3. The main result follows from three 
lemmas about each of the parts Vi. Let K — K{n) be as in Theorem ll.il The first 
lemma gives a lower bound for the clique number of G{n). 

Lemma 2.1 For any m — m{n) 



Vo 
V2 



Vo{n) 
Vi{n) 
V2{n) 



{v e V{G{n)) : X, < 0i}; 

{v e V{G{nj) : 0^ < X„ < ^2}; 

{v £ V{G{7i)) : 02 < X„}. 



w(G2) = \V2\{1 - op(l)) = K{1 ~ Op(l)). 



^To understand the idea of the proof one can assume L{x) = 1 and ei = 0. 
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The next two lemmas provide the upper bound. 

Lemma 2.2 Suppose there is l3 > a — 1 such that m — il{n^). If ei < ^ then 
there is 6 > such that 

Lemma 2.3 Suppose there is j3>2~a such that m — If ei < ^^2i" then 

uj{Gi) = op(K). 

Proof of Theorem [TA] We can take ei > so that ei < min(^^, ^~^+° ). By 
LemmaO uj{G) > ^(62) = K{1 - op(l)). Since w(G) < w(G'o) + w(Gi) + lu{G2) 
we have uj{G) < K{1 + op(l)) by Lemmas [2?2] and [231 □ 

We will often use a random intersection graph G(n, m, P) with P the binomial 
distribution with parameters m and p. This model is called the random binomial 
intersection graph and denoted G{n,m,p). 

In G(n, m,p) the events w G Sy are independent for all pairs {w, w), w G v e 
= [n] , and occur with probability p. For a random variable B with distribution 
Binom{m,p) and any positive e < |, we have by a Chernoff bound (see [l3]): 

P(|B - mp| > emp) < 2e-^''™P. (9) 

Therefore if (G(n, m,p), n = 1,2,...) is a sequence of random binomial intersection 
graphs with m = m(n), p = p(n), then the sizes — Xy{n) of the random subsets 
of G{n,m,p) satisfy 

F{3v e [n] : \Xy - mp\ > y) < nF{\Xy - mp\ > y) (10) 

for any y ~ y{n) such that y/\/mplnn — > 00. 

2.2 Large sets (the graph G2) 

In this section we use ideas from [5] to give a lower bound on the clique size for a 
sequence of sparse random intersection graphs. 
We first note the following technical facts. 

Lemma 2.4 Suppose a = an,b = bn are sequences of positive reals such that < 
In 26 + 2a — > +00. Let z„ be the positive root of 

a-\nz-bz'^ = 0. (11) 

Thenz - , / 2«+i"(2&)" 

Proof Changing the variables t — 2bz^ we get 

t + \n{t) = 2a + ln(26). 

From the assumption it follows that t + \nt ^ t and therefore z„ ^ \/ ^""''i?*-^''-* . □ 
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Lemma 2.5 ([7j) For any slowly varying function L we have InL(x) = o(lna;) as 
X — ^ oo. 



Proof of Lemma [2TT] Write N = \V2\ and let 




V 



be the vertices of V2 listed in a descending order of their set sizes, i.e. \Sy(i)\ < 
for z = 2,..., TV. 

Consider a greedy algorithm for finding a clique in G proposed by Janson, 
Luczak and Norros [3] (see also Section S]). Let A° = 0. In the step i = 1,2, . . . , N 
let = A^~^ U {w'^*-'} if w*^*-' is incident to each of the vertices w^-'-' , j = 1, 1. 
Otherwise let A* = A^^^ . This algorithm produces a clique H on the set of vertices 
A^ , and H demonstrates that lo{G2)>\A^\. 

Let L be the set of vertices that failed to be added to A^ , i.e L = V2 \ A^ . We 
will show that 



Since by Markov's inequality P(|L|/A^ > < ^ (l-^j/^) jg enough to show that 



Write 9 — 92- Let pi be the probability that two random 6'-sets from W = [m] 
do not intersect. Conditionally on TV, the number of vertices in L is at most the 
number of pairs in x,y ^ V2 where Sx and Sy do not intersect. 
Therefore by the first moment method 



\L\/N^op{l) 



and 



iV = (1 - L ((nln7i)i/2) (Inn)^"/^ 



n 



^/2(l-op(l)). 



E{\L\/N) 0. 



E{\L\/N) =EE(|L|/iV|iV) < E 




N < 



ENpi 
2~ 



Now 




and N ^ Binom{n, q) where q = P(X„ > 9). Using ([2]) we have 




(l-a/2) 



L{\/ nln?i)(ln7i) 



-a/2 l-a/2 



Here we used equivalences L{{n/my^'^9) ^ L{\/n Inn) and lnL(-\/n Inn) = o(lnn) 
which follow from Lemma 12.51 
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By (fTO| we have N — EN{1 — op(l)). It is straightforward to check that for 
some constant c we have ENpi < c(lnn)~"/^ — > 0. This completes the proof. 

Let us briefly explain the intuition for the choice of 9. For simplicity assume 
L{x) = 1 so that ei = 0. Could the same method yield a bigger clique if 02 is 
smaller? It turns out that the given value for 02 is (asymptotically) optimal. It can 
be obtained by solving 

ni-"/2m"/20-"e-«'/™ = 1 

or 

a"Mnn + l/21n(m/n)-ln6' = (12) 

am 

which is of the form where a — a~^ Inn + l/21n(m/n) and b — (am)^^. 

We have that fee^" = a^^na^^ +oo and by Lemma [2^ the solution of ([T2|) 
satisfies 

/ {2/ a) In n - ln(n/m) + ln(2/am) /- -— 

^ V(l-«/2)mlnn. 

□ 



2.3 Small sets (the graph Go) 

In this section we consider the graph Go. This graph has n{l — op(l)) vertices and 
all the sets Sy are of size at most m^/^n^^^. We say that an intersection graph 
G = G{V, W) contains a rainbow Kh if there is a subgraph H C G onh vertices and 
an injective map, assigning each edge xy G E{H) an attribute Wxy G Sxf^ Sy C W. 
The next lemma follows directly from the results of 

Lemma 2.6 Let {G{n), n — 1,2, . . .), G{n) = G{n, m,p), m — m{n),p — p{n) be 
a sequence of binomial random intersection graphs, let h be a positive integer and 
suppose that pn^/^^~^^m^/'^ — > a € {0, 1}. Then 

P(G contains a rainbow Kh) — > a. 

Proof The case a = 1 follows from Claim 2 of 11 . For the case a = we have by 
the first moment method 

P(G contains a rainbow Kh) < (yj^ (™)('')?'^ 

(5) 

when n^/C'^^^TO^/^p -^0. □ 
In the next lemma we give a crude upper bound for uj'{G). 
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Lemma 2.7 Let {G{n),n = 1,2,...), G{n) = G{n, m, P), m = m{n),P = be 
a sequence of random intersection graphs such that EY{n) = 0(1). Suppose (G{n)) 
is power-law with index a £ (1; 2) and there is P > a — 1 such that m = ^l{n^). 
Then there is a constant 5 > such that u}'{G{n)) < n^-"^/^-^ whp. 

Proof Write V = V{G{n)) = [n], W = W{G{n)) = and let X = 

X{n) and Y — Y{n) be defined as in (jlj. For a fixed key w G W and a vertex 

e 5.) = V -P(|5.| = fc) = — = 

Since 

|Tu,| ^ Binom n, ) (13) 



/mn J 



we have, for any positive integer fc 



, , (n\ ( "^Y \^ ( en'&Y^" f ci ATT^ 



for ci — e sup„ E 1". Therefore 

k 

k 

Fix 5 with < (5 < min((a - 1 - ;9)/4, 1 - a/2,13/2). We have 



>(a;'(G(n))>fc)<m(^J- 



' (^'(G(n)) > < m (cin"/2-i/2+^^-i/2) 



since m — oo, t^i-^/s-"? _s> qq a^j^(j fop large enough n 

^Q/2-l/2+5y^-l/2+5//3 ^a/2-l/2+5-/J/2+5 ^ ^(q-1_^+45)/2 



□ 



The last and the most important fact we need relates the maximum clique size 
with the maximum rainbow clique size in an intersection graph. An edge-colouring 
of a graph is called f-good if each colour appears at most t times at each vertex. 
Given an edge-coloured graph G we say that G contains a rainbow copy of H if 
G contains a subgraph H isomorphic to H such that no two edges of H have the 
same colour. 

Lemma 2.8 ([1]) There is a constant c such that every t-good coloured complete 
graph on more than vertices contains a rainbow copy of Kh ■ 
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Proof of Lemma 12.21 Define p — p{n) — m ^/^n + to ^/■^ and fix an integer 
> 1 + J_. Then 

P(Go contains a rainbow Kh) — ^ 0. (14) 

Indeed, since ei < ^, by pH]) each random subset in G{n, m,p) has size at least 9i 
whp and the probability that Gq contains a rainbow Kh can be bounded by 

P(G(n,TO,p) contains a rainbow Kh) + o(l). 

Hence follows by Lemma [2.61 since ri^/(''~^'TO^/^p — > 0. 
We have 

P(w(Go) > fc) < P(w'(G(n)) > t) + P(w(Go) > k \ a;'(G(n)) < i). 

By Lemma O there is 5 > such that for t = n^-^l'^-^ 

P(a;'(G(n)) > i) ^ 0. 

By (|14p . the probability that Go contains a rainbow clique of size h conditional on 
uj'{G{n)) < t tends to 0. Therefore by Lemma the probability that Gq contains 
a rainbow clique of size at least = O (n^""/^""^) also tends to 0. [To see that 
we can apply Lemma [2.81 to an intersection graph G, colour each edge xy € E{G) 
by an arbitrary element of Sx C) Sy. If i^'{G) < t then clearly each colour appears 
at each vertex at most t times.] □ 

2.4 Intermediate sets (the graph Gi) 

We start with a version of the result of Erdos and Renyi about the maximum clique 
(see, for example, |10]). By the number of cliques of size <: in a graph G we mean 
the number of subsets S C V{G) of size t which induce a clique in G. 

Lemma 2.9 Consider a sequence of Erdos and Renyi random graphs {G{n,p) 
n = 1,2,...) with p — p{n) — 1. Let K — K{n) — and suppose r — r(n) 

satisfies r = o{K'^). 

Then there are positive 5 = 5{n) and e =^ e(n) such that 6,e ~> and for any n 
and any sequence of graphs (-R„) with V{Rn) — [n] and \E{Rn)\ < r{n) the number 
X{n) of cliques of size lK(l + S{n))\ in G{n,p) U Rn satisfies 

EX(n) < e{n). 

Proof Write h = 1 — p and pick a positive sequence S so that 5 — and In""'^ n + 
h+^ = oiS). Let a ^ \K{1 + 6)1 . We have 

lEX(n)<Qp©-<(^)%^-. 
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But 

ln(en/a) + - L^lnp = ln(en/a) + - {-h + 0{h^)) 

< Inn (^-6 + + hj'j -> -oo, 

which completes the proof. □ 

We will need the following lemma. 

Lemma 2.10 Let J- ~ {Ai, . . . , A/j} he a family of sets from {1, . . . , n}. Let S be a 
random subset of {1, . . . ,n} of size d. Lf \Ai\ — ai, . . . , \Ak\ — and X]i=i CLi l^n 
then 

P {{S n ^1, . . . , S* n Ak\ has a system of distinct representatives) (15) 

is maximised (subject to \Ai\ = a^, i = 1, . . . , fc) inhen {Ai} are independent. 

Proof Call any of (^) possible outcomes c for S a configuration and let C be the 
family of all configurations. Given a configuration c G C, colour an element x € [n] 
black if a; e c and colour it white otherwise. Given T = {Ai, . . . ,Ak} let Cdr{^) 
be the set of all configurations c where c n = {c n ^i, . . . , c n Afe} has a system 
of distinct representatives. 

Suppose is a family of sets that maximises p^ . and suppose without loss of 
generality that there is an element x G [n] and an index i e {2, . . . , fc} such that 
X £ Ai n Ai. Since l^d — ^ there is an element y in the complement of 

Now let A[ — Ai\{x}Li{y} and consider another family of sets J^' = {A[ , A2, . . . , Ak}. 

Observe that the family of configurations C = Cdr{^) \ Cdr{^') has the fol- 
lowing property: for each c G C the element x is black, the element y is white and 
it is not possible to find a set of distinct representatives for c n so that A is 
matched with an element other than x (otherwise excluding x from A and includ- 
ing y would not matter). Now, given a configuration c, let Cxy be a configuration 
with the colours of x and y swapped. Suppose c £ C. Then Cxy ^ Cdr{J') and 
there is a set of distinct representatives for sets cD {T\Ai) which does not use x. 
So Cxy G Cdh^T), because y is black in c^y and can be matched with Ai. Thus 
each 'bad' configuration c G C is compensated by a unique 'good' configuration 
Cxy S Cdr{^') \ Cdr{^)- This shows that \Cdr{F')\ > \Cdr{^)\ and completes 
the proof. □ 

Let G — G(V, W) be an intersection graph, let t be a positive integer and let R 
be any subset of edges of the complete graph We denote by Rainbow {G, R,t) 
the event that in the graph G + R there is a clique H of size t where each edge 
xy € E{H) \ R can be represented by an attribute Wxy € Sx Sy so that the 
resulting map from E{H) \ i? to is injective. For two functions f,g:N^ R+ 
we write f g ii f{n)/g{n) — >• 0. 
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Lemma 2.11 Let (G{n,'m,p),n = 1,2,...) be a sequence of random binomial in- 
tersection graphs where m ~ m{n), p = p{n). Suppose K = K{n) and r = r{n) are 
positive and K — 2e™P Inn < n/2, mp^ —)■ oo, ^ — >■ 0, Kp — > and r = o{K'^). 

Then there are positive sequences {en)i{5n) such that e,i,(5ri ^ and for any 
{R{n), n = 1, 2, . . . ) with R = R{n) C E{K,,) and \R{n)\ < r{n) 



P {Rainbow{G{n, ■m,p), R, K{1 + Sn))) < e„, n = 1, 2, . 
Proof Let x = x{n) be positive and such that 



px — 0, > and -v/ mp In n <^ x. 

mp 

Observe that such an x always exists since using the conditions of the lemma 



\/ mp In n ^ mp and \/ mp In n = p ^ \J mp^ In n <C p ^ VPK <^p ^ . 

Now consider n as fixed and let M — mp + x. For the graph G ~ G{n, m,p) define 
random sets Sy, v G V hy 



S{v) 



Sy, if <M, 

the first M elements of Sy, otherwise. 



Fix t e [K; 2K]. Let T = {ui, . . . , be any fixed subset of V{G), let Rt be the 
edges in R with both ends in T. 

For i £ {1, . . . ,t} we consider the event that the family of sets {Sy. n Sy^ \ j = 
1, . . . ,i — l,VjVi ^ R} has a set of distinct representatives. Denote this event by 
AT{i) and define 

t 

At := Pi Arii). 

1=1 

We will prove below that whenever n is large enough 

P(At) < (l-(l-p)*0^'^"'^^'- (16) 

If y = 1 — (1 — p)^^ and G' — G{n,p') is the random Erdos-Renyi graph then the 
quantity on the right is the probability that the subgraph induced on T is complete 
in G" + R. Let Xt be the number of i-cliques in G" + R. 

By Lemma [231 there are positive sequences (e„), (5„) such that e„, (5„ ^ and 
for K' = j^il'p^M (1 + Sn) we have EXk' < for any positive integer n. 

Note that (fT6|) can be applied for sets T of size t = K' whenever n is large 
enough since 

21t177 21ti7? 

for some (5^ — >■ since px — )■ and mp'^ <^ mp^plnn <C Kp — !• 0. 
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So by ((T6| the expected number Yk' of K'-sets T' CV such that At' holds in 
G satisfies EYk' < EXk' < for some 0. 

Let B = B{n) be the event that each v G V satisfies ||5'„| — mp\ < x. By 
([TU)) we have that P(-B) 0. Observe that B and Rainbow{G, R, K') imply that 

Y/i" > 0. Now define e" = i-r{B) ^(-^) note that e" — > as n — > oo. Since 
Yr-' are non-negative by Markov's inequality we finally obtain: 

F{RainbowiG, R, K')) < ¥(Yk' > 0\B) + ¥{B) < j^^gy + ^(B) < 

It remains to show ((TE)) . We have 

t 

HAt) = 1[F{At{i)\At{1), . . .,At{i - 1)) . 

i=l 

Recall that T ~ {ui,...,Wf} and for i = 1, . . . , t let be the number of j G 
[i — 1] such that VjVi ^ R. Fix i £ [t]. Assume without loss of generality, that 
ViVi, . . . , ViVs^ ^ R and WjWs,+i, ■ • ■ , ViVi-i G R. 
Since tM = 0{pKm) 

Si 

X] \Sv, \ < tM < TO. 

By conditioning on sizes of sets S^^, j = l,...,Si and the size of the set Si by 
Lemma 12.101 we obtain that for large enough n 

nAT{i)\S,,=A,,...,S,^^ = 

is maximised when the sets Ai, . . . , As- are independent. 
Therefore we can remove the conditioning so that 

P{At{i)\At{1), . . .,At{z - 1)) < (1 - (1 - P)'T 

and 

P(At) < (l-(l-p)*^)^''' = (l-Cl-p)*^)^^)""""'. 

□ 

Lemma 2.12 Consider a sequence of random binomial intersection graphs {G{n, m,p), n = 
1,2,...), m — m{n), p = p{n). Assume that the following holds: 

1) np = 0(1), 

2) Inn <C mp, 

3) mp^ — oo, 

4) m{npf < e2'"P'(l nn) , 
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5) e^P'plnn^ 0. 

Then there is a sequence (5„ — ?■ such that 



P {uj{G(n,m,p)) > 2e™P' lnn(l + 5„)) ^ 0. 

Proof Recall that denotes the set of vertices that chose the key w G W. We 
have ^ Binom{n,p) and 

P(|T^|>fc)< Q/. 

Call a key w red if |Tt„| > 3. Denote by R the set of edges generated by all the red 
keys. Then since limsupnp < oo 

^1^1 - ™£ (2) (fc)^'' - Mnpf{e"P - l)/2 = 0{m{npf). 



Let r — r(n) be such that m{np)^ <^ r <^ e^™''^(lnn)^. 

Consider n as fixed and fix a subset of attributes Wo C W and family To = 
{To{w) : w G Wo} of subsets of where each subset has size at least 3. Suppose 



E 



f\To{w)\ 



< r. 



Consider the event A{Wo,To) that in G = G{n,m,p) the set of red attributes is 
exactly Wo and T^, = To{w) for each w G Wo ■ 

The sets T^, w G W are iid with a common distribution T so that \T\ ~ 
Binom{n,p). Given that \T\ — k the set T is uniformly random from all the fc-sets 
of V. 

Conditional on the event A(W^o,7o), the sets Tz for the remaining attributes 
z e \ Wo are also iid and for k G {0, 1, 2} 

p(|T.|^.|A(H'.,r.))^e||[||, 

Furthermore, the size of the set \Tz\ given the event A{Wo,To) is stochastically 
dominated by |r| (unconditional). 

Let i?o be the set of edges covered by the sets in To and write K = 2e™P In n(l + 
5n) where (5„ obtained in Lemma l2.11l for m, n,p and r. 

Now an important observation is that 

P(cj(G) > K\A{Wo,To)) < V{Rainbow{G,Ro,K)). 

(Here the left side is equal to the probability that G — Rq contains a rainbow copy of 
H such that uj{H U Ro) > K, while the graph on the right side has (stochastically) 
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bigger sets Tq(z) and the rainbow graph H can additionaly have edges generated by 
attributes from Wq .) 
Therefore we have 

nuj{G)>K)^ PHG)>K\A{Wo,To))nA{Wo,To)) 

Wo To 

< J2 ^ ^l^(W^o,To))P(A(W^o,ro)) +P(|i?| > r) 

H'o,ro:|flo|<r 

< J2 HRainbow{G,RQ,K))¥{A{Wo,To))+TP{\R\>r) 

Wo,ro:\Ro\<r 

< max F{Rainbow{G,Ro,K)) +F{\R\ > r). 

\B.o\<r 

When n oo the second term in the last hne tends to zero by Markov's inequahty. 
The first term tends to zero by Lemma 12.111 and the proof is complete. 

□ 

Finally, we are ready to prove Lemma 12.31 

Proof of Lemma 12.31 Fix e, ec > such that e < ec < min{eo, 1 — a/2, (/3 — 2 + 
a)/4—3aei} (recall that eo is the constant from ([5])). Let ^2 = ((1 — a/2 — ec)mlnn)^ 
Let G'l be the subgraph of Gi induced by vertices v with < First note that 

D = \ViG,) \ V{G[)\ = c{e,)K{l + op(l)). (17) 

where c(ec) = (1 - a/2 - ec)""/^ - (1 - a/2)-"/2. 

Indeed, D - Binom{n,¥{X > O'^) ~ ¥{X > 6*2)), © and Lemma ED yield that 
ED ^ c{ec)K and PT|) follows by a Chernoff bound, see ([TU)) . Since c(ec) -> as 
Ec — > it suffices to show that for any Cc > 

c^(G'i) = op{K). 

We have that |F(G']^)| is stochastically dominated by Binom{n,P{X > di)) and by 
@ and a Chernoff bound whp \V{Gi)\ < ni = (1 + e)n^-°'/^+°"'K Let 

p.(l-a/2-.)V^(i^)''' 

then by (jlOp in the random binomial intersection graph G{ni, m,p) all the sets Sy, 
V € [ni] are of size at least 62 whp. Therefore 

P(a;(G'i) >x)< ¥{uj{G{ni,m,p)) > x)+o{l). 

Now by Lemma [2. 121 (we check that all of the conditions of this lemma hold) there 
is a sequence of positive numbers i5„, (5„ — >■ such that 

P {uj{G{ni,m,p)) > 2ni~"/2-^ inn(l + Sn)) ^ (18) 

which completes the proof. □ 
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3 Finite variance 



In this section we prove Theorem II .21 

3.1 Large cliques and rainbow K^s 

Let G — G{V, W) be an intersection graph of a family {Sy : v G V} of subsets of 
an attribute set W . Fix a graph H. We say that S" C ]/ is a witness of rainbow 
H in G if there is a bijection / from V{H) to S such that each edge xy S E{H) 
can be assigned a distinct element Wxy G W with Wxy S Sf(x) H 5'/(y). 

For a family C of subsets of W, indexed by elements of V there is a unique dual 
family C* of subsets of V, indexed by elements of W given by C* = {S^ : v G V} 
where — {w e W : v e Gw}- The family of sets {S^ : v £ V} defining an 
intersection graph G = G{V, W) is dual to the family of its monochromatic cliques 
{T^:we W}. 

Given a family C of subsets of V indexed by W and a graph H, we call a set 

5 ^ V SL witness of rainbow H, if S* is a witness of rainbow H in the intersection 
graph defined by the dual family C* (for example, if iJ is a complete graph, this 
means that each pair of elements in S can be assigned a different set in C that 
covers that pair). We call C a clique cover of H, if V{H) C V and for each edge 
xy € E{H) there is a set in C containing both x and y. 

The main lemma of this section is the following structural result for clique 
covers. 

Lemma 3.1 Let K andp be positive integers. There are positive integers h = h{K) 
and to = ta{K,p) such that the following holds. 

Let W be a set of .size at least t + h for some positive integer t > to — h. Every 
clique cover C — {Ci, . . . ,Gr} of the complete graph on W with maxi \Ci\ < t and 
maxi^j |Ci n Cj l < p contains at least K witnesses of rainbow if 4. 

Furthermore, we may take h{l) — 4, h{k) = 0{k^/'^) and to{k,p) = 0{p{k^^^ + 

P))- 

Proof Let h be the smallest integer such that (^) > K. Suppose that \W\ >t + h 
and C has fewer than K witnesses of rainbow K4. Note that C must not contain a 
rainbow Kh, for otherwise there are at least (^) > K witnesses of rainbow K4. By 
Lemma l2. 81 if C does not contain a rainbow Kh then there is a constant a = a{K) 
{a = where c is the constant from Lemma [2.81) such that \B\ > a\W\ for some 
set B G C. Now \B\ < t so W\B contains a set 5" of size h, say S = {xi, . . . , Xh}- 
If every pair of vertices {xi,Xj} in the set S", 1 < i < j < is covered by at least 

6 sets of C then for any 4-subset S' of S we can assign a unique set for each pair 
of elements in S' and demonstrate a rainbow K4. In that case C contains at least 
(4) ^ K witnesses of rainbow K4. 

So suppose one of the pairs of elements from S is contained in at most 5 sets 
in C and without loss of generality assume it is {xi, X2}. Let ixi,x2 be the index of 
one of those sets (at least one set must cover both xi and X2 as C is a clique cover 
of the complete graph). 
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Since the intersection of any two distinct sets in C has at most p elements, in 
order for every pair {{xi, y}, y G B} to be covered at least — sets need to be used. 

Thus we can pick a set Bi C i? of size at least '-^ where each ?/ e -Bi is represented 
by a unique set Ci^^ ^ containing both xi and y. 

Now remove from i?i the elements y such that X2 € C^^^ ^ (there are at most 5 
of them). Also remove those elements y which belong to the set Ci^_^ (there are 
at most p of them) . Call the newly formed set B'^ . Notice that 

, ,, \B\ a\W\ 
\B[\>—-b-p> - — --b-p. 
P P 

Let us now treat the indices {1, . . . , r} of the sets in C as colours. Consider the 
complete graph G on vertices B U {xi,X2}- Colour each edge xiy of G, where 
y £ B[ with the colour ixi,y Colour the edge X1X2 with ixi,x2 and for every edge 
2/1 2/2 G B[ use the colour iy^ = j corresponding to the set B (so that B is the set 

Finally let {x2, 2/2} be coloured with an arbitrary colour ix2,y2 such that C^^^ 
covers both X2 and 2/2- 

Take any element yi G B[. We claim that for any element y2 & B[\ Gi^^ , the 
set {xi, 0:2, 2/1, 2/2} witnesses rainbow K4^. Indeed, by the construction, the colour 
ixi_,x-2 of the edge X1X2 occurs only once, because B[ n C^^^^^ = 0. Similarly, 
xi,X2 ^ B, so j occurs only once. The colours of the two other edges incident to xi 
occur only once, since we removed all candidates y such that X2 S C^^^ Finally, 
'i'X2,yi 7^ ix2,y2 since we chose y2 outside Ci^^^^^ . 

How many such witnesses can we form? For any yi we could choose j-BJI — 
1^1 ^ ^i^i j,i I ^ l^il ~ P suitable 2/2- Repeating this for every yi we will produce 
every 4-set at most twice. So C contains at least 

witnesses of rainbow K4. 

Thus since C has fewer than K witnesses of rainbow K4 and \W\ > t + h it 
must be that 



1 fa\W\ 
2 



2 

- 5 - 2p I <K 



or 



Therefore we can set 
to complete the proof. 




□ 



The random power-law graph studied by Janson, Luczak and Norros [5] whp 
does not contain K4 as a subgraph if the exponent a is larger than 2. In our case 
a similar result holds for rainbow cliques. 
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Lemma 3.2 Let (G(n), n — 1,2, . . .) be a sequence of random intersection graphs 
such thatEY{nf = 0(1). Then the number R = R{n) of 4-sets S C V{G{n)) that 
witness a rainbow in G{n) satisfies 

Furthermore, if there is a sequence e„ — )■ such that nF{Y(n) > enn^/^) — >■ then 
G{n) does not contain a rainbow whp. 

The relevant probability for a fixed graph is given by 

Lemma 3.3 Let G = G{k, m, P) be a random intersection graph. Let Xi, . . . , Xk 
be the sizes of random subsets of G. For any integers xi, . . . ,Xk such that P{Xi — 
xi, . . . ,Xk = Xk) > 

P(G has a rainbow Kk\Xi = xi, . . . ,Xk — Xk) < m 2 {xiX2 ■ ■ -Xk) 

Proof By considering all injective maps assigning for each edge xy G E{Kk) an 
attribute Wxy Sx C\ Sy and using the union bound and independence, we obtain 
that the relevant conditional probability is at most 

(to) k (^l)fc-l i^2)k-l ixk)k-l ^ ixiX2 ■ . ■Xfc)^"^ 

(2) {m)k-i (m)fe_i ■ ' ■ (m)fc_i ~ mH^-^y^ 

□ 

Proof of Lemma l3.2l in G{n). Denote X^ = Xy{n) ~ \Sv(n)\ and Y — Y{n), also 
write X Ay = min(a;, y). We have, using Lemma 13.31 

:r< ('"V pi -^2X3^4 , A ^ n^EiX,X2X3Xi)^ _ (Ey2)4 



4/ V / ~ 4! m* 4! ' 

For the second part of the lemma, let b ^ b{n) ~ en\/m and let A = A{n) be the 
event that max^gy Xy < b. Then 

F{R>1)<ER1a+F{A). (19) 

By the union bound the second term is at most 

nF{X >b) = nF{Y > CnU^/'^) 0. 

The first term by Lemma 13.31 satisfies 



Ei?I^ < ( \m-''EiXiX2X3XiYlA < g 
\4 / 4!m° 

= e^(Er2)4 ^ 0. 

So P{R > 1) ^ 0. □ 

Given a random intersection graph G, we let W(G) denote its set of attributes. 
The next result shows that the structure of random intersection graphs with EY(n)'^ 
0(1) is relatively simple. 
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Lemma 3.4 Let {G{n),n = 1,2,...), G{n) — G{n,m{n), P{n)) be a sequence of 
random intersection graphs such thatEY{n)'^ = 0(1). Ifm{n) — >■ oo then whp each 
pair {w' ,w"} C W{G{n)) belongs to at most two random sets S^, v S V{G{n)). 

Equivalently, this lemma says that the intersection of any two monochromatic 
chques of G{n) is of size at most 2 whp. 

Proof For G — G{n) fix a pair of distinct elements w' , w" £ W{G) and v € V{G). 
We have 

m., , „ „^ V^^/,^ , ,x fc(fc-l) 

ny^\w" e s.) = Y.^{\s.\ = ^)—, k = —( TV 

^— m[m — 1) m[m — 1) 

< < — . 

n(m — 1) nm 

for some constant c > 0. 

By the union bound the probability that there is a pair of elements that is 
contained in at least k sets is at most 

T) -It)' 

which tends to zero when fc > 3. □ 



Proof of Theorem [TH Let R = R{n) be the number of 4-sets 5 C V{G{n)) wit- 
nessing rainbow in G{n). By Lemma l3.41 the intersection of any two monochro- 
matic cliques has at most 2 vertices whp. In that case, by Lemma 13.11 cither 
uj{G{n)) < toiR + 1, 2) or uj{G) < lo'{G) + h{R + 1). 

The proof of Lemma [3TT] shows that the functions to(fci 2) and h{k) are O(Vfc) 
as k ^ 00. Thus, 

a;(G(n)) < oj' (Gin)) + Z{n) 

where Z{n) = to{R + l,2) + h{R + l), and EZ^ = 0{ER) = 0(1) by LemmaO 
If nP{Y{n) > e„n^/^) — > for some e„ — > then by Lemma [3T2] G(n) whp does 
not contain a rainbow K4, so whp uj{G) < to{l, 2) V (w'(G) -I- 3). □ 



3.2 Monochromatic cliques and balls and bins 

In this section we find the asymptotics of uj' {G{n,m, P)). We start with a sim- 
ple lemma for approximating the asymptotic behaviour of the maximum bin load 
M(iV,m). 

Lemma 3.5 Let {N{n)) and (m(n)) be sequences of positive integers such that 
N — N{n) — > 00 and m = m{n) — > 00. Let ((Jn)? (^n) positive, Sn — >■ 0, e„ — 
o{Sn). Write M' = M'(n) = M([A^(1 e„)J ,m) and M = M(n) = M{N,m). 
Then there is a coupling between M' and M such that M < M' and 

P(A/' - (5„EAf' < M) ^ 1. 
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Proof We produce the coupling as follows. Throw [A^(l + e„)J balls into m bins. 
This gives an instance of M' . Denote by L the label of the bin with the lowest 
index realising the maximum. 

Now delete a uniformly random set of [cnA^J balls. The configuration with the 
remaining N balls gives an instance of M < M' . If M' = Mq and L ~ Lq, for some 
fixed Mq, Lq, the expected number of balls deleted from the bin L is by the union 
bound at most ^ 

e„iV^ = e„M^. 
Now applying Markov's inequality and averaging 

P(M' -M>t)< e„r^EM' 

and the result follows by putting t = (5„E Af . □ 

Proof of Theorem 11.31 Write G — G{n). The choice of random sets of sizes 
Xi,. . . ,Xn in G can be interpreted in the balls and bins setting as follows. We 
have m distinct bins and for each i we have Xi balls labelled i. For each i we choose 
a random subset of Xi bins and place one ball labelled i to each of the Xi bins. 
Then is the set of labels of the balls in the bin w. 

We now relax the requirement that a bin can contain only one ball labelled i. 
Namely, for each ball let us select one bin independently at random. Call a ball bad, 
if it landed in the same bin with another ball with the same label. By conditioning 
and the union bound, for any label i 

P(there is a bad baU labelled i) = E (^'\ -Xr < ^"^"'^"^ . 

\ 2 J mn 

Therefore, by the union bound 

F some ball is bad) = 0(to-^). (20) 

Define S'{i) as the set of bins that contain at least one ball labelled i. We have 
that |S"(i)| < Xi are independent for i e [n] and given that |S"(i)| = s each set of 
size s is equally likely (by symmetry). In particular, if M = M{n) is the maximum 
number of balls contained in any single bin, for any n and any Borel set A we have 

P(a;'(G) e A) = ¥{M e A\ no bad balls); 

and using (|^ 

dTv{.i^'{G),M)^o{l). 
Write N = N{n) = J27=i and Mq ^ Afo(n) = M{[EY{mny^^\ , m). Since 

|P(w'(G) eA)- P{Mo e A)\ < drviM, Mo) + o(l), 
to complete the proof it remains to prove the following. 

dTv{M,Mo) ^0. (21) 
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We will show this by constructing a pair of random variables M' and Afg, copies of 
M and Mq respectively, in a single probability space so that 

P(M' 7^ Af^) ^ 0. 

Let (z„) be a sequence of positive numbers, such that z„ — > oo, z„ = o((lnm)^/^) 
and denote e„ = z„n-i/2. Define iV_ = [EiV(l - e„)J, Ar+ = [¥.N{1 + e„)J and 
No = [EF(TOn)i/2j. 

For each n, define the following sequence of random variables, independent of N 
and M . Similarly as in the proof of Lemma |3.5[ throw A^_|_ balls into ni bins. Now 
successively pick one ball uniformly at random and delete it, repeat this — -/V_ 
times. Denote by Af^, k ^ N^, . . . , N+, the number of balls in a bin with maximum 
load after N+ — k steps. The sequence (AfAr_ , . . . , M^^) is non-decreasing and Mk 
is distributed as M{k,m) by symmetry. 

Now define 

Mn if iVe [iV_,iV+] 
M otherwise. 



M' = 



Note that M' is distributed as M{N,m) and 

By Chebyshev's inequality, since EY'^ = 0(1) and EF = 9(1) 

By Lemma [331 we have that P(Afjv_ ^ ATat^) — >■ as soon as e„EAfAr^ — >• 0. But 
this is true. Let Bi be the number of balls in the first bin (at step k — 0). For 
t — t{n) — I Zn we have 



E Az/at^ < t + miV+P(Si > t) 
t 

'eK 



< t + mN+ ( ' ^ I m"* 



< t + miV+ 



< t + exp |o(ln n + In m — ^ lnTO)| 

□ 



4 Algorithms for finding the largest clique 

In the analysis of algorithms in this section we assume that graphs are represented 
by the adjancency list data structure. We have already used a version of the 
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Greedy-Clique algorithm of [9] in Section 12.21 to give a lower bound for the 
maximum clique. The algorithm below sorts vertices by their degrees (instead of 
sorting by the random subset sizes). 



Greedy-Clique(G): 

Let v^^\ . . . , u*^") be V{G) sorted by their degrees, descending 

M ^ 

for i = 1 to n 

if w'*) is adjacent to each vertex in M then 
M ^ MU {wW} 
return M 



Proposition 4.1 Consider a sequence of random intersection graphs {G{n)) as in 
Theorem \l.l\ Suppose holds for some e > 0. The algorithm Greedy-Clique 
finds a clique of size uj(G{n)){l — op(l)) in time 0{n^). 

Proof (Sketch) The running time bound is obvious, we only need to check that the 
algorithm returns a clique of the correct size. Fix any positive constants a, b with 
a < 1/4, where e is given in ([2]). Let 62, ei be as in Section \2A\ Set 

T = T(n) = EY{{1 - a/2) Inn + 61)^^/^(1 + n-"). 

and 6 = 62(1 + b). Let Q = Q{n) be the set of vertices in V{G{n)) with degree 
larger than r and let R — R{n) be the set of vertices u in V{G{n)) with X{u) > 9. 

We claim that whp Q does not contain any vertex u with < 02 and whp 
R C Q. We skip the proof of this, since it consists of standard applications of the 
Chernoff bounds and is very similar to the proof of the second part of Lemma 11.41 

Finally, applying a Chernoff bound again, we can obtain that \Q\ > \R\ = (1 + 
b)-°'K{n){l + Op{l)) andbyLemma[2Tla;(G(n)[Q]) = (l + &)-"X(n)(l + Op(l)). 
Now the claim follows by Theorem 11.21 since we can make b arbitrarily small. □ 

For sequences of random intersection graphs with bounded degree variance (as 
in Theorem 11.21 and Theorem II. 3p we need a different but also extremely simple 
algorithm. 



Mono-Clique(G): 


for uv e E{G) 

c^{u,w}u(r(u)nr(w)) 

if \G\ > \M\ and C is a clique then 
M ^G 
return M 
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A slightly more robust version of Mono-Clique, which still gives the sanie 
result would be to construct a maximal clique starting from each edge (without 
backtracking). It is difficult to imagine a simpler algorithm for finding large cliques. 

Theorem 4.2 Consider a sequence of random intersection graphs {G{n)) as in 
Theorem M.SX The algorithm Mono-Clique /iwds a clique of size uj{G{n)) — Op{l) 
in expected time 0{n). 



We first need the following lemma. 



Lemma 4.3 Let (G{n)) be as in Theorem ] 1.3\ Then whp either G{n) is empty or 
there is a monochromatic clique T with \T\ = uj' {G{n)) and two vertices u,v G T 
such that u ^ V and \S{u) D S{v)\ — 1. 

Proof By Theorem 11.31 (and the coupling provided by its proof) it suffices to 
consider the N balls into m bins model, where Xi balls have label 1, X2 balls have 
label 2, etc., and N = Xi + ■ ■ ■+Xn. The balls, even those with the same label, are 
distinguishable. We denote the label of a ball b by label{b). As before, for v G [n] 
let S'{v) be the set of (indices of) bins with a ball labeled v. For j £ [m], let Zj be 
the number of balls in the bin j. Let L be the index of the (leftmost) bin containing 
maximum number of balls. 

Let A — A{n) be the event that either the bin L contains two balls with labels 
I1J2 respectively such that S'{li) n S'ih) = {L} or Zl < 1. Since by ([20]) no bin 
contains two balls with the same label whp, in order to prove the lemma, it suffices 
to show that P{A) — ^ I. To do this, we define three other events for each n 

B = B{n) := {\N -¥.N\ < eEN} 

n 

C = C{n) := {Y^Xf/N < cm^/^n-^/^} 

i=l 

. 2EN(EN + m- 1), 
D = D{n) := {J] Z| < "^-^ >), 

where c = 2KY'^ /EY, and e is any constant, e G (0; O.I). We have 

V{A) < f>{A\B, C, D) + ¥{B) + P(C') + P(Z)). 

Let us show first that the last three terms tend to 0. P(_B) — >■ by above. 
Now consider the event C. Again, by (|22]) 

P(7V < 2-^m^/'^n^/'^EY) = P(iV < O.SEiV) 0. 

Also, by Markov's inequality 

nyX^ > n'^^mEY^) < , = n-'^\ 

' ' - n^/^mEY^ 

1=1 
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So combining the last two bounds we get 

n 
i=l 

To see that P{D) — > 0, note that since 

P{D) < P{D\B)+F{B). 

it suffices to show that 

F{D\N ^ No)^0 

for each sequence A'o — No{n) with \No — E < eE A^. A simple calculation yields 
that 

E(E^') = ^^^^H^^ (23) 

j=i 

therefore by Chebyshev's inequality, since iVo — > oo, we have 

((i-46)E 

It remains to prove that 

P{A\B,C,D) ^ 0. (24) 

For a fixed n and vectors x S {0, 1, . . . , m}" and z E {0, 1, . . . , mn}™', such that 
i=i — Z-^j=i '^^ dehne events 

E{x, z) {Xy — Xy, for each v G [n] and Zj — zj for each j e [m]}. 
To prove ([24]) . it is enough to show that for every pair {x,z), such that A'o = 

2mEZ^, maxj Zj > 2 we have, if n is large enough 

¥iA\E{x, z)) < 2m-i/2n-i/2(]Ey)-i + 3c2„-i/2_ 

For each such x and z we may define a Markov chain M^y with state space Sxy 
consisting of all configurations of balls for which the event E{x, z) occurs. Let the 
distribution of the initial state, sq, be specified by the random assignment of the 
balls conditional on the event E(x, z). Then by symmetry sq is uniform over Sxy 
The transition probabilities of M^y are defined by a random walk in S^y as 
follows. Given that we are in state s choose two balls &i, 62 from the bin L indepen- 
dently without replacement, and choose another two balls &'i,&2 from all A^o balls 
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without replacement. Exchange the bins of bi and b'l, then exchange the bins of &2 
and 62. This gives a new state s' of M^y 

Note that if &2 7^ b[ then the bin L in the state s' contains both bahs b'^ and b'2. 
Furthermore, by symmetry ^gP{s' — > s) = X^sIP'l* for any state s' S iS^ry- 

Therefore the uniform distribution over 5^;^ is a stationary distribution for M^y 
So, denoting by the probabihty conditional on E{x, z), we have 

P,(A) < P,(&1 = &2) + P*(^^ n SL ^ 0); P,(6'i = &2) - ^; 

1 ^ iVQ 

and for all large enough n we have 

n 

P*(S'^, n S*^, 7^ 0) = J2 ^*(^^ ^^3^ ^)P{label{b[) = I, ?a6e?(6^) = j) 



<yA-^-.y.4k]"-^ (25) 



\ fc=l 







< \ ^4 = (26) 

< Sc^n-^/^ (27) 

Here (|25p follows since the conditional probability that a particular pair of balls 
lands in the same bin is by the union bound at most 

The bound ([26|l follows since on the event E{x, y), the events B and D occur, which 
imply, using ((23|) that for n large enough 



N§ ~ (1 - e)^ \mEN m'^ 



m 



since E 00 and e < 0.1. 

Finally, the bound ([77]) follows since on -B(x, y) the event D occurs. □ 

Proof of Theorem 14.21 The fact that Mono-Clique finds a clique of order 
u){G{n)) — Op{l) (which is a maximum monochromatic clique) follows by The- 
orem 11.21 and Lemma 14.31 

The running time of the algorithm is the sum of the number of steps to find the 
common neighbours for each edge and the number of steps to test if these common 
neighbours form a clique. 
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We assume that the elements in each hst in the adjacency hst structure are 
sorted in increasing order. Otherwise they can be sorted using any standard sorting 
algorithm in 0(^^gy time, where V = V{G{n)) and d{v) = dG(^„){v) is the 

degree of v in G{n). The intersection of two lists of lengths ki and ^2 can be found 
in 0{ki + ^2) time, so the total time for finding common neighbours is 



o[Y,d{vmv)-i) 

\v£V 



The time for checking if pairs of vertices are connected is a constant times the num- 
ber of 2-stars in G{n). So the time for testing cliques is again 0(X]t,Gy d{v){d{v) — 
1)). But 

E J2 d(v){d{v) - 1) ^ 0{n) 

vev 

by (|37)) in the proof of Lemma 11.51 □ 



5 Equivalence between set size and degree param- 
eters 

Proof of Lemma 11.41 We start by showing that if either E F or E Z3i converges 
and 

for some positive a = a(n) — > then 

EF = (ELii)!/^ +0(1). (28) 
By the inclusion-exclusion principle we have 



Xo X-l Xo \ ^ / ^ ^ , ^ 1 X X' 

^ J- ^ 1 ^ ml C C -L ChW V \ ^ ^ 



Z ■.= Qy [:illl±-lll^ \ <p(5in52 ^0|Xi,X2) < ^^i^. (29) 

Notice that EF = f7(l). This is clear if EF ^ y e (0;oo). Otherwise, EDi^ d 
and by ^ 

(n- l)(Er)2 

>EDi =d + o{l). 

n 

Now observe that by the first inequality of (|29|) 

EZ > EZIxiX2<am > (1 - a)rn"^EXiX2lxiX2<am 

> (1 - a)m-iEXiEX2 - m-^EX^X2lx^X2>ara (30) 
and since EF = 0(1) and Eyiy>(„„)i/2 -> we get 

EXiX2lxiX2>am < EX1X2 (lxi>(am)i/2 + I^s > (am)i/2 ) 

<2EXEXI;,>(,„)V2 



{m/ny/^EXEYIy^^,„y,2 = o{{EXf). 



26 



Combining the last estimate with (jSOj and ((29)) we get that 

m 

and p8)l fohows. This completes the proof of the first claim. 

Let e S (0; e) be smaller than eq from ^ or e' from (|3]), depending on which 
of the two conditions hold. To prove the second claim, we will show that for each 
6 <E (0; 1) and each sequence (i„, n = 1, 2, . . . ) with n^/^"'^ < i„ < n^/^"*"' we have 

P(r„ > t„) > + ,5))-"P(i?i(n) > tn) (31) 

and 

P(r„ > i„) < _ S))-"¥iD,{n) > U). (32) 

Proof of the lower bound hSl]) . We prove by contradiction. Suppose there is a 
subsequence (rife) and a sequence (5fe) with n^'^ l£ bk < ^H'^^'^ such that 

P(r(nfe)>6fc)<(di/2(l + 5))-"P(i?i(nfe)>6fe) for fc = l,2,... 



Let us define two sequences of positive numbers (t^) and (Ik) depending on whether 
(HD or (g]) holds. If © holds, then let tk = bk- If O holds, then let h = bu- In 
each case, define the other sequence by tk — (i^/^(l + 5/2)lk. Also write Yi(n) = 
{n/mf^Xi^n). Then 

P(i?i(nfc) > tk) = P(i^i(nfc) > tk,Yi{nk) > h) + P(i?i(nfe) > tfc, Yi(nfe) < ?,-)• 

(33) 

We claim that the first term on the right is bounded by 

{c + o{l))¥{Di{nk)>tk) as fc oo, 

where c = {jr+rY < 1- Indeed, if (g]) holds then 

nOiiuk) > tk,Yi{nk) > Ik) < nViiuk) > Ik) 
< {d^/\l + S))-'^F{Di{nk) > Ik) - cF{Di{nk) > tk). 

Similarly, if ([2]) holds then 

P{Di{nk) > tk,Yi{nk) > k) < P(i^i(rifc) > ^fe) 
^ ^ ^/2)"P(yi(nfe) > tk) < cFiDiiuk) > tk). 



Let us now consider the second term on the right of ([33]). It satisfies 

P{Di{nk) > tk,Yi{nk) < Ik) < P{Di{nk) > tk\Xi{nk) = Xk), (34) 

where Xk — [i'm{nk) /nk)'''^^lk\ . By ([36)) . conditionally on Xi{nk) — Xk, Di{nk) is 
stochastically dominated by the random variable Bk ~ Binom{nk — l,Pfc) where 
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= Sl^. Since tk = d^^^{l + S/2)lk - {l + 6/2)EBk, by the Chernoff bound ^ 
we have for large k 



PpiKO > tk\Xi{nk) = < P(Bfe > il + d/'S)EBk) < g-^^f^^^) = e-^^^'"'). 

Using the last estimate in ([Ml) , we finally have that for large fc the right hand side 
of p3l) is less than the left hand side, which is a contradiction. This finishes the 
proof of ([3T1) . 

Proof of the upper bound \32\) . Suppose, that there is a subsequence [rik) and 
a sequence (fo^) with l£ < ^i'^^'' such that 

Pl^-K) > > - 5))-"P(Z?i(nfc) > fc = 1, 2, . . . 

Again, define sequences of positive numbers (tfc), (Zfe), depending on what we want 
to prove. If ^ holds then let tj. = 6fc. If (U) holds, let = ^fe- Define the other 
sequence by setting = — 6/2)lk. We have that 

Ppi(nfc) > tk) > F{Y{nk) > Zfc)Ppi(nfc) > tk\Y{nk) > h)- (35) 

By the definition of [Ik) and {tk) and either ^ or (U), the first term on the right 
is at least (c + o(l))P(I?i(nfc) > tk) where c = (^x^) > 1- Therefore it suffices 
to show that the second term of (|35l) is 1 — o(l) to obtain a contradiction. 

Let Xk = [('Ti(nfc)/n/j)^/^?fcJ . Conditionally on Xi — Xk, Di{nk) has distribu- 
tion Binomirik — l,Pfc), where pk is the probability that a random subset of size 
Xk intersects a random subset of size X2. 

To estimate pfe, notice that (P^)) and (PH)) still hold when Xi and X2 have 
different distributions (P(Xi(nfe) = Xfc) = 1 and X2{nk) has distribution P{nk)). 
Also, with a(n) = (lnn)~^ we have 

ExkX2(nk)lxkX2>a(nk)rn(nt,) < XkE X (nk)lY (nk)>a(nk)nk /Ik =o(XfcEX(nfe)) 

by ©. Therefore by we get that 

EX{nk)xk EY{nk)lk 
Pfc -, — ^ (36) 

m(nk) Uk 

as k ^ 00. 

So tk — d^^'^{l — 5/2)lk ^ (1 — S/2)MBk, where Bk is a random variable with 
distribution Binom{nk — l,Pk)- Using ^ again, we get for k large enough 

¥iDi{nk) < tk\Yink) > h) < P(Sfe < (1 - S/3)EBk) = e-^^^^^") ^ 0. 

□ 

We now prove Lemma 11.51 Similar identities have been obtained in [3] , we 
include the proof for completeness. 

Proof of Lemma 11.51 The identity ([7]) follows from (pS)) since 
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It remains to show Note that EDi{Di — 1) — 2EVi, where Vi is the number of 
2-stars in G{n) centered at vertex 1. li A — A{n) is the event that 12, 13 € E{G{n)) 
then EVi = {^^{A). Conditional on sizes Xi,X2,X3 of the first three random 
subsets of G{n), the events 12 e E{G{n)) and 13 G E{G{n)) are independent. 
Furthermore, by symmetry, P(A|Xi, X2, Xg) = f{A\Si = [Xi],X2,X:i). By the 
inclusion-exclusion principle, for j = 2, 3 

X-\ X ^ I X-\ Xn \ ^ , „ / „ , N N , V X-\ Xj 



1-^^ <n^] ^ E{G{n))\Xi,X2,X3) < 

m J m 

So 

^, EX^X2X3 EY^iEY)^ , , 

Since P(A|Xi, X2, X3) is nonnegative 

P(A) >E (P(^|Xi,X2,X3)I^^<,„:/.I;,^<,„V2lx3<™V2) 

> (1 - e^)^E ^ — 3— I ^Ixi<e„ii/2lx2<emi/2ljf3<„„i/2^ 

> (1 - e^)^E ^ ^(1 - Ixi>emi/2 - Ijf2>„„i/2 - Ix3>emi/2)^ 

The bound in the last line follows since EF^ > (EF)^ = ^(1) a-nd by the uniform 
convergence assumption 

EX2l^^>^,„V2 - ^Ey2i^^^^,/2 =o(EX2) 

and also for j = 2, 3 

EXjI_^^>„„i/2 = m^/^n-i/^jgyj^^^^^^^ ^ o(EX). 

Now we have that EDx{Dx - 1) = Er2(Ey)2(l - o(l)) and dH]) follows by 0. □ 



6 Concluding remarks 

In this work we determined the order of the clique number in G(n, m, P) for a 
wide range of m — m{n) and P — P{n). We saw that in sparse power-law ran- 
dom intersection graphs with unbounded degree variance, the clustering property 
of G{n, m, P) has little influence in the formation of the maximum clique. This 
suggests that simpler models, such as the one in [5], may be preferable in the case 
of very heavy degree tails. However, when the degree variance is bounded, most 
random graph models, including the Erdos-Renyi graph and the model of [9! have 
only bounded size cliques whp. In contrast, we showed that in random intersection 
graphs the clique number can still diverge slowly. 
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We have a kind of "phase transition" as the tail index a for the random subset 
size (degree) varies, see Assume, for example that m = 6(n). When a < 2, 
the random graph G(n, m, P) whp contains cliques of only logarithmic size. When 
a > 2, it whp contains a 'giant' clique of polynomial size. But what happens when 
(HI) is satisfied with a = 2 but the degree variance is unbounded? 

We proposed a surprisingly simple algorithm for finding (almost) the largest 
clique in sparse random intersection graphs with finite degree variance. The perfor- 
mance of both Greedy-Clique and Mono-Clique algorithms can be of further 
interest, since these algorithms do not use the possibly hidden random subset struc- 
ture. How well would they perform on arbitrary sparse empirical networks? Can we 
suspect a hidden intersecting sets structure for networks where the Mono- Clique 
algorithm performs well? 

Another direction of possible future research would be to determine the asymp- 
totic clique number in dense random intersection graphs. For example, even in 
the random uniform hypergraph case where m = Q{n) and the random subset size 
X(n) — r2(n^/^) is deterministic, exact asymptotics of the clique number remain 
open. 

Acknowledgement. V. Kurauskas would like to thank prof. A. Iksanov for 
clarification of a few technical details. 
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